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(57) Abstract 

A high-speed circuit that performs unsigned mode, two's complement mode, and two types of mixed mode multiplication- 
accumulation with equal facility. The circuit is an array (10) constructed from ten different adder elements (FA IS, FA1 A, FA2A, 
FAC, FACA, FACC, FAAC, FAAC3, HAC, and HAC2, which correspond to Figures 13-22, respectively). The array has two 
multiplier input operands (X and Y) and one accumulator term (Z), the three of which may be expressed as binary power expan- 
sions. Final addition of sumout and carryout terms of the array (10) is performed by final adder (1 1), which may comprise any of 
several possible adder configurations, including full carry lookahead, carry select, and conditional-sum type adders. Speed is ac- 
complished through the use of high-speed adder elements having few gate delays, and by summing all even array rows together 
and all odd rows together, then adding the even sum with the odd sum in the final adder (1 1) using a Wallace tree technique. The 
circuit incorporates a high degree of regularity and interconnectivity, which facilitates compact circuit layout 
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UNIVERSAL MULTIPLIER-ACCUMULATOR 



Field of the Epventiop 

This invention generally relates to high-volume high- 
speed digital parallel processing, in particular a 
multiplier-accumulator integrated circuit array that will 
5 accept input operands in two's complement, unsigned 

magnitude, and mixed modes. 



Background of the Invention 

A multiplier-accumulator is a computational device 
that multiplies two multiplier input terms and sums the 

10 product with an accumulator term, providing a final output 

term. A parallel multiplier has the characteristic of 
operating on multiple bits within the terms simultaneously, 
that is, in parallel. High speed parallel multipliers and 
multiplier-accumulators and their efficient integration 

15 into silicon or some other semiconductor substrate have 

been of interest since the early days of digital computers. 

Many arithmetic circuits which are widely used in the 
digital arts are designed for use with two's complement 
binary signals. While such signals and the associated 

20 circuits are particularly well adapted for performing 

efficiently under many circumstances , two's complement 
multiplication often requires the use of specialized 
circuits for correcting errors or characteristically 
incomplete results. Attempts have been made to reduce 

25 specialized circuitry in two's complement arithmetic 

circuits. See, for example, US patent number 3,866,030 by 
inventors Baugh and Wooley for a two's complement parallel 
array multiplier. 

A multiplier-accumulator that can operate equally well 
30 acr ss two's complement, unsigned, and mixed modes is 
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d sirable. A first obj ctive of the present invention is, 
therefore, to provide a multiplier-accumulator that is 
universal across all modes. 

Although a multiplier-accumulator has speed advantages 
5 when compared to equivalent circuits, more speed is always 

desirable. A second objective of the present invention is, 
therefore, to provide a multiplier-accumulator that has 
high operational speed. 



It has long been recognized that an arrayed circuit 
structure comprised of identical or nearly-identical 
components facilitates circuitry layout (whether the layout 
is performed by hand or with computer-aided design tools) . 
In addition, an arrayed circuit structure comprised of 
regular components lends itself local interconnects, which 
minimizes parasitic capacitance and resistance, thus 
increasing circuit speed- A third objective of the present 
invention is, therefore, to provide a universal multiplier- 
accumulator having an array structure with a high degree 
of component regularity, in order to facilitate circuit 
layout and minimize the length of interconnects. 



Summary pff t:he Inve^tipn 



The invention is a circuit that can perform universal 
multiplication-accumulation. It is universal in the sense 
that it operates with equal facility on operands of 
unsigned magnitude, operands in a two's complement mode, 
and operands of mixed modes. 



By deriving special compensation expressions for terms 
having negative weight for each multiplication mode (i.e., 
two's complement, unsigned magnitude, and mixed modes) , and 
by grouping these compensations in a particular manner, it 
is possible to eliminate the need for the generation of 
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negative partial products. Further, by making these 
compensation expressions conformal across all 
multiplication modes, specialized circuitry is minimized 
and circuit regularity is maximized. Similar adder 
5 elements are arrayed, with all specialized circuitry being 
implemented as circuit modifications on some of these 
elements, thus improving circuit regularity and 
interconnect ivity . 

Fast adder elements are included to improve overall 
10 speed of the invention. Speed is further increased with 
the use of multiple-row addition techniques. 

Brief pescyAptjon of the Drawings 

Figures la-lc depict multiplier input operands X and 
Y and accumulator input 2 in binary form (e.g., in Figure 
15 la, m=8 for an 8-bit word, and each a is a bit in that 
word) ; 

Figures 2a-2o depict expressions that yield magnitudes 
of two's complement terms X te , Y tc , and 2 tc ; 

Figures 2d-2f depict expressions that yield magnitudes 
20 of unsigned magnitude terms X w , Y w , and Z m ; 

Figures 3a-3d depict expressions for the products 

*tc=*tc Y tC P ™ =X «« *A= X tc and *fT* m Y tC 

respectively; 

Figures 4a-4e, 5a-5f , 6a-6f , and 7a-7e depict two-row 
25 compensation derivations for L, N, O, and Z, respectively? 

Figures 8-11 depict multiplication-accumulation in 
unsigned mode, two's complement mode, and mixed modes A and 
B ("A" figures show a standard representation, M B" figures 
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show the representation condensed and slightly rearranged, 
and "C" figures show an array configuration that will 
perform the multiplication-accumulation) ; 

Figure 12a is block diagram of the preferred 
5 multiplier-accumulator; 

Figure 12b is a schematic of the array portion of the 
preferred multiplier-accumulator; 

Figures 13-22 detail preferred adder elements and 

their operation ("A" figures are schematics of the 

10 elements, while "B" figures are logical symbols, which are 

used in Figure 12 , for example) ; 

Figure 23 depicts a logic diagram for a preferred 
Exclusive-OR (XOR) gate; and 

Figure 24 depicts the logic circuitry which generates 
15 control signals TCA, TCB, TCC, and MXM. 



Preferred E mbodiment of the Invention 

The preferred embodiment of the universal multiplier- 
accumulator makes use of certain algorithms, the 
understanding of which is essential to an understanding of 
20 the multiplier-accumulator. Therefore, a derivation of 

these algorithms follows. 

1 . Derivations 

The universal multiplier-accumulator has two 
multiplier input operands X and Y (also called product 
25 terms) and one accumulator term Z, expressed in Figures la- 

ic as binary power expansions. The two possible 
accumulator input modes are two's complement and unsigned 
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magnitude. In this disclosure, subscript "tc" indicates 
two's complement mode, and subscript "uns" indicates 
unsigned magnitude mode. For example, X tc indicates X 
interpreted as a two's complement number. Further, in this 
5 disclosure, a term may be represented as a vector. For 

example, X (shown as a binary power expansion in Figure 1) 
may also be represented as the vector (Vt'Vz''--^)' Y as 
( b n-i ' b n-2' • • • f *>o) ' and z as ( o il#n . 1 , c^, . . . , c 0 ) . 
Multiplication is indicated by the symbol " " and addition 
10 is indicated by the symbol ■'+". Logic operations are 

indicated in capitals, such as in the expression "a AND b", 
for example. 

Figure 2a is a two's complement expression of X: when 
X is to be interpreted as X tc , the expression in Figure 2a 
15 will yield the magnitude of X^. Figures 2b-2c show 

corresponding expressions for Y tc and Z tc . Figures 2d-2f 
show similar expressions for X m , Y M , and Z m . 

Since each of the input operands X and Y can be in 
two's complement mode or unsigned magnitude mode, there are 

20 four possible multiplication modes for the operation P=X Y. 

These are two's complement mode (* tc =X tc Y tc ) , unsigned 
magnitude mode (P^ Y m ) , mixed mode A (P A =X tc Y^) , and 
mixed mode B (P^X^ Y tc ) . Figures 3a-3d, respectively, show 
expressions for each of these multiplication modes, based 

25 on Figures 2a-2f . 



Note in Figures 3a-3d that each product includes 
different sign combinations of expressions L, M, N, and O. 
That is, P tc »+IrWf-N-0, P^L+M+N+O, P A =-L+M+N-0, and 
P B =*-L+M-N+0. This is a significant observation and has 
30 bearing on the design of the preferred embodiment. 

It is well known that binary addition of a two's 
complement form of a word is equivalent to its unsigned 
binary subtraction. Addition is more straightforward and 
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readily implemented than subtraction. To obtain a two's 
complement form, a word is inverted and a 1 is added to it. 
In the above-listed multiplication modes, terms L, N, and 
O are subtracted. In the preferred embodiment, "two-row 
5 compensations" are derived to aid in these subtractions. 

The two-row compensation L^, for example, is a pair of rows 
of bits that when summed with an operand, gives a final 
accumulation that is as if L was subtracted. The two-row 
compensation contains two's complement arithmetic in a form 
10 that allows efficient design of the subject multiplier- 
accumulator. 

Figure 4a shows the expression for term L, written the 
same as in Figure 3a, for example. L may also be expressed 
as a summation of the two rows in Figure 4b. Figure 4c 
15 shows L inverted and a 1 added to obtain a two's complement 

inversion, rewritten in Figure 44. Figure 4e shows L in a 
two's complement binary matrix form. Figure 4e is the two- 
row compensation I*^. 

Two-row compensation is generated as shown in 

20 Figures 5a-5f . Figure 5a expresses term N, the same as in 

Figure 3b, for example. N may also be expressed as shown 
in Figure 5b. Note that b^ is a binary constant, a 1 or 
0. If b^el, then N reduces to Figure 5c. If b^-0, then 
K and its two's complement are both zero. For b n „ 1 =l, the 
25 two's complement of N is shown in Figure 56. Using this 

information, a generalized expression for the two's 
complement of N can be created, shown in Figure 5e. The 
expression of Figure 5e is then converted to K^, shown in 
Figure 5f . 

30 Term O (Figure 6a) is similar to N. The derivation of 

°2rc shorn* in Figure 6f, is therefore derived using a 
similar technique. 
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Since the invention accumulates as well as multiplies, 
two-row compensation Z^ for accumulator term Z must also 
be derived, since Z may be positive or negative in any 
multiplication-accumulation mode involving two»s 
5 complement. This is shown in Figures 7a-7e. Note that 
when c B|m . 1 =l f Z<0, and when c^^^O, Z>0. 

With reference to Figures 8-11, the four multiply- 
accumulate modes are now detailed for 8-bit X and Y terms 
(m=8, n=8) . In each figure, a somewhat standard 
10 representation of the operation is shown, and then an 
equivalent, preferred arrangement. The preferred 

arrangements are readily adaptable to embodiment in a 
single circuit. M remains the same across all cases. 

Figure 8a illustrates an unsigned mode multiplication- 
15 accumulation, that is, X m Y^+Z^. Elements of L, M, N and 

O are shown in their proper columns for addition. Figure 
8b shows the same algorithm, but with O moved over the 
upper left edge of M. Figure 8c shows this operation in an 
array configuration. The correct result is obtained if 
20 addition is diagonally performed. 

Figure 9a illustrates a two^ complement mode 
multiplication-accumulation, that is, X tc Y tc +Z tc . Recall 
from Figure 3a that M and 0 are subtracted in this mode. 
Instead of subtracting M and O in the preferred embodiment, 

25 two-row compensations K ac and are summed. Because Z may 
be positive or negative in this mode, Z ac is also summed. 
Figure 9b shows the same derivation, but with all ones in 
the two-row compensations pre-summed and all but two 
elements of O ac moved over the upper left edge of M. For the 

30 specific case of a multiply-only algorithm (no 

accumulation) , the inventive method reduces to the familiar 
Baugh-Wooley two's complement multiplier method. Figure 9c 
shows the operation in an array configuration. The correct 
result is obtained if addition is performed diagonally. 
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Mixed mode A (X^ is illustrated in Figure 10a. 

As shown in Figure 3c, L and O are subtracted in this mode. 
Ther fore 1*^ and O ac are added. z ac is also added because 
Z may be positive or negative in this mode. Figure 10b 
5 shows the corresponding preferred arrangement. As in two^ 
complement mode in Figure 9b, all ones in the two-row 
compensations are pre-summed. As above, Figure 10c shows 
an array configuration. 

Finally, Figure 11a illustrates mixed mode B 
10 ( x w Y tc +Z tc> • As skowi in Figure 3d, L and N are subtracted 

in this mode. Therefore 1^. and are added. Z^ is also 
added because Z may be positive or negative in this mode. 
Figure 11c shows a corresponding array configuration. 

2 • Embodiment 

15 The preferred embodiment of the invention includes an 

n by a multiplier array having multiple- function adder 
elements to perform the above-derived algorithms as 
represented in the array configurations of Figures 8c, 9c, 
10c, and lie. For the sake of illustration, an 8 by 8 

20 multiplier-accumulator according to the invention is shown 
in Figures 12 a and 12b. The preferred embodiment merges 
the four multiplication-accumulation modes into a single 
arrayed set 10 of adder elements 100-107, 110-117, 120- 
127, 130-137, 140-147, 150-157, 160-169, 170-179, and 180- 

25 188, detailed in Figures 13 through 23, of which the "A" 

figures show a schematic, and the "B M figures show a 
corresponding logical symbol used in Figure 12b. It is 
noted that some of the signals shown in the logical symbols 
are to interpreted differentially, and some 

30 nondifferentially. For example, SIN in Figure 14b is 
differential (representing true SIN and complement SIN*, 
shown in Figure 14a), while input a f is not. 
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The elements will now b described, 
chart is provided for cross-reference: 



The following 



Adder 
Type 



Figure 



Elements 



10 



15 



FA18 

FA1A 

FA2A 

FAC 

FACA 

FACC 

FAAC 

FAAC3 

HAC 

HAC2 



13 

14 

15 
16 
17 
18 
19 
20 
21 
22 



110, 120, 130, 140, 150, 160, 170, 

172-177, 180-187 

111-116, 121-126, 131-136, 

141-146, 151-156, 178 

100-106, 171 

161-166 

107, 117, 127, 137, 147, 157 

167 

168 

169 

179 

188 



20 



FA18 (Figure 13) is a 3-bit fully differential full- 
adder cell, with propagation times of 2 gate delays from 
sum-in to sum-out and 1 gate delay from carry-in to carry- 
out, when the preferred EXOR circuit of Figure 23 is used. 



FA1A (Figure 14) performs: 
(a- bj) + SIN + CIN. 

FA2A (Figure 15) performs: 
25 (a^ b 0 ) + (a, b t ) + CIN, 



FAC (Figure 16) performs: 
((A^. b ml AND TCB) OR (a^j b n . t AND TCB* ) ) + 
SIN + CIN, where TCB* is the inverse of TCB, 



30 



<« 



FACA (Figure 17) performs: 
B^j AND TCA) OR ( 



, h n _ } AND TCA*)> 



+ CIN. 
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FACC (Figure 18) performs: 

<<Vl B n-2 AND TCA > OR < a «-l b n-2 AND TCA*)) + 
((A^p b^, AND TCB) OR (a^ b^ AND TCB* ) ) + CIN. 

FAAC (Figure 19) performs: 
5 { (a.., b^ AND MXM) OR (NOT (a^ b^) AND MXM*)) + 

TCA + b n . 1 TCB. Note here that A B+l is logically 
equivalent to NOT (A B) . 

FAAC3 (Figure 20) performs: 
((C^, AND TCC) OR (C B|H1 . 1 AND TCC*) + A^., TCA + B^, TCB. 

10 HAC (Figure 21) performs: TCC -I- SIN. 

HAC2 (Figure 22) performs: A + B. 

FAC, FACA, FACC, FAAC, FAAC3 are all adder elements 
that perform more than one type of addition, and are 
controlled by signals TCA, TCB, TCC, and MXM. The 
15 generation of TCA, TCB, TCC, and MXM is detailed in Figure 

24 and in the chart below. 

Mode TCA TCB TCC MXM 



Unsigned 0 0 0 0 

20 Mixed Mode B 0 10 1 

Mixed Mode A 10 0 1 

Two's Complement 1110 

The above elements when connected as shown in Figure 
12 perform together as a universal multiplier-accumulator, 
25 capable of multiplying in unsigned mode, two's complement 

mode, and mixed modes A and B. 



Figure 12b further shows the preferred embodiment 
having multiple data input bits (logical 0) , (logical 
1) , , bp-1^, and carry in bits c 0 -c 15 . Output bits 
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include carry-out bits c 0 -o 15 and sum bits s 0 -s u . MXM, 
TCA, TCB, and TCC are control inputs, and NC denotes a no- 
connect. Sum-out and carry-out bits s 0 -s 16 and co 0 -co 15 are 
summed in final adder IX of Figure 12a, 

5 Simplicity and interconnect ivity are improved by 

designing similarity across all elements. For example, 
each full-adder shares the same basic primitive cell (FA1S) 
with compensation logic included as required. Some adder 
elements must be able to perform more than one function. 
10 It is primarily for this reason that several adder types 
are used in the preferred embodiment. 

For example, element 168 of Figure 12b multiply- 
accumulates a 7 b 7 in unsigned mode, a 7 by+a^^ in two's 
complement mode, Kj B 7 +a 7 in mixed mode A, and A 7 Bj+bj in 
15 mixed mode B (compare Figures 8c, 9c, 10c, and 11c, 
respectively) . 

The partial products within term H for all 
multiplication modes are performed within elements 100- 

106, 200-206, 300-306, 400-406, 500-506, 600-606, and 700- 
20 706. For example, the partial product b 0 is implemented 

by FA2A element 100 when connected as shown in Figure 12b. 

Term O of Figure 8a is shown in Figure 8b at the upper 
left side of M. These terms are implemented by elements 

107, 117, 127, 137, 147, 157, and 167. For example, term 
25 a? b 0 in Figure 8b is implemented by element 107 when 

connected as shown. 

Term N of Figures 8a and 8b is implemented by elements 
160-166. For example, term a<, b 7 is implemented by element 
160 when connected as shown. 

30 These terms in other modes are similarly accomplished 

by the preferred embodiment. 
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Elements 108 and 171 are now further elucidated. Term 
a.., is added in the column of in two's complement mode 
and mixed mode A. Term b n . 1 is added in the column of in 
two's complement mode and mixed mode B. In the preferred 
embodiment, m=n=8, so a7 and/or b7 are added in the column 
of c7 in all but unsigned mode. The addition of term a 7 is 
accomplished by element 10B. Note that 108 outputs a 
differential signal. The addition of term *>j is 
accomplished by element 171. This arrangement allows 
proper addition of a^, and b fl _ 1 even if m»/n. 

Final addition of the sumout and carryout terms of the 
preferred array 10 is performed by final adder 11, shown in 
Figure 12a, which comprises any of several possible adder 
configurations, including, for example, full carry look 
i ahead, carry select, and conditional-sum type adders. 

The preferred embodiment is fast when fast adders 
(such as the preferred) are used. Speed is further 
improved in the preferred embodiment in Figure 12 by 
summing all even rows together, summing all odd rows 
) together, and adding the even sum with the odd sum in final 

adder 11 (a Wallace tree technique) . It is noted that 
three or more groups of rows can similarly be summed, but 
the apparent speed improvement thus gained is lost due to 
increased parasitic capacitances in the longer 
5 interconnections. A path limit of 22 mils or less is 

deemed desirable to limit parasitics and optimize speed. 
Short interconnections therefore makes Wallace tree 
summation of two groups of rows preferable. 
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Many variations may be made t the embodiment without 
making it a different invention. Different adder element 
designs may be used, as well as alternate EXOR designs. 
Although the preferred embodiment is integrated into a 
semiconductor substrate using CMOS techniques, other 
fabrication technologies might be used. The circuit may be 
constructed discretely and still embody the same invention. 
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1. Apparatus to obtain a product P from terms X and Y and 
accumulate it with a term 2, the result for use in a 
digital processing system, wherein P is one of the group 
5 P tc =x tc Y tc/ P A=X tc K„, and P B =X W Y te , wherein X 

is representable as vector (a.,,^ a,,) and Y as vector 

( b n-i ' b n-2' comprising: 

L means, to form L « a B _ 1 b nM 2 1 "*"" 2 ; 

M means, to form M = ss a 5 bj 2j+j for i from 0 to m-2 and j 
10 from o to n-2; 

N means, to form H = S a, 2"" 1 *' for i from 0 to m-2; 

O means, to f orm o = s a,., bj 2~^ for j from 0 to n-2; 

compensation means, to form one or more compensations L^, 
and °2»c ln the event that one or more of t, k, and o, 
15 respectively, are negative; 

P means, to form P of one of the group P,„, p^, p a , and p B , 
wherein V m is formed by summing I», M, N, and O, P fc is 
formed by summing l, m, n^, and o ac , p a is formed' by 
summing i^, m, n, and O a , and P„ is formed by summing L^, 
M ' H ac' and °»* and 



20 
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accumulation means, to accumulate P with Z; 



wherein all summations are done in an interconnected array 
of substantially identical 3 -bit adders arranged in a 
Wallace tree. 



5 2. Apparatus to form a product P of X=(a m . 1 , a^, . • . ,a 0 ) and 
Y=(b nM ,b n _ 2 , . . . , b 0 ) , and accumulate P with a term Z, wherein 
the result is for use in a digital processing system, and 
wherein each of X and Y may be in two f s complement mode or 
in unsigned mode, comprising: 

10 first means, to form terms a f bj for all i and j yielding 

positive values; 

second means, to form a combination of positive terms 
equivalent to any combination of terms a f bj which is 
negative ; 

15 third means, to selectively combine said terms formed by 

said first means and said combination of positive terms 
formed by said second means, to form P; and 

accumulation means, to accumulate P with Z; 
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wherein all summations are done in an interconnected array 
of substantially identical 3-bit adders arranged in a 
Wallace tree. 



3. Apparatus to form a product *=(P** n .i*P m «.n-2' • • • 'Po) of 
5 X=(a Brl ,a ni _ 2r ... f a 0 ) and Y^b^ ,b n . 2 , . . . , b 0 ) , and accumulate P 

with a term 2, wherein the result is for use in a digital 
processing system, and wherein each of X and Y may be in 
two's complement mode or in unsigned mode, comprising: 

means to complement terms a i and bj to obtain terms A ? and 
10 B jf respectively, for 0<i<m-l and 0<j<n-l; 

means to form terms a^., b n . 1# a f b jf a, b nM , A f b nM , a^, bj, 
and a m . 1 B jr for 0<i<m-2 and 0<j<n-2; 

means to form a term which is the complement of said term 

a m-1 *W 

15 means to form at least one term having the constant value 

1; and 



means to selectively combine said terms so formed, 
including: 
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means to form p k for 0<k<m+n-2, to form P for X and Y 
both in unsigned mode, by summing said terms a, bj 
where i+j=k, and carries from the sum for k-1; 

means to form p k for 0<k<m+n-l, to form P for X and Y 
5 both in two's complement mode, 

by adding to the sum for each k for 0<i<m-2 and 
0<j£n-2 said terms a, b J# a^, b n _,, A f b n _,, a^ B,, 
and a^,, b n . 1# wherein the sum of the subscripts 
within each term is equal to k, 

10 by adding said terms a m . 1 and b n _-, to the sum for 

k=m+n-2 , 

by adding said terms A m _ 1 , B n-1 , and 1 to the sum 
for k=m+n-l, and 

by adding carries from the sum for each k-1 to 
15 the sum for each k; 

means to form p k for 0^k^m+n-l, to form P for X in 
two's complement mode and Y in unsigned mode, 

by adding to the sum for each k for 0<i<m-2 and 
0<j<n-2 said terms a- b j# a^, a f b nM , a^, B jf and 
20 B n . 1# wherein the sum of the subscripts within 

each term is equal to k, 



10 
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by adding said terms a^ and 1 to the sum for 
k=m+n-2 , 



by adding said terms A m . 1 and 1 to the sum for 
k=m+n-l , and 

by adding carries from the sum for each k-1 to 
the sum for each k; 

means to form p k for 0<k<m+n-l, to form P for X in 
unsigned mode and Y in two's complement mode, 

by adding to the sum for each k for o^i^m-2 and 
0£j£n-2 said terms a f bj, b n . t , A f b nM/ a^ b jW and 
B n ^, wherein the sum of the subscripts within 
each term is equal to k, 

by adding said terms b nM and 1 to the sum for 
k=m+n-2 , 



15 by adding said terms B nM and 1 to the sum for 

k=m+n-l, and 

by adding carries from the sum for each k-1 to 
the sum for each k; and 



accumulation means, to accumulate P with Z; 
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wherein all summations are done in an interconnected array 
of substantially identical 3-bit adders arrang d in a 
Wallace tree. 

4. Apparatus to obtain a product P and accumulate it with 
5 a term Z, the result for use in a digital processing 

system, wherein P is one of the group P tc =x tc Y tc , 
p « =x « Y i™' p A"*tc Y i«' and p b =x i™ Y tc wherein X is 
representable as vector (ap^a^, • • • ' a o) and Y as vector 
( b n-i ' b n-2' • • • ' b o> ' comprising: 

10 first means, to form 

* " b n-1 a™' 2 , 

M = EE a. bj 2 i+j for i from 0 to m-2 and j from 0 to 
n-2, 

N = E a f b nM 2 n " H! for i from 0 to m-2, and 
15 O = E a^ b. 2 m " 1+j for j from 0 to n-2; 

compensation means, to form one or more compensations L^, 

N 2rc and °2rc in the event that one or more of L, N, and O, 
respectively, are negative; and 

second means, to form P of one of the group P,„/ P tc , P A , and 
20 P B , wherein P^ is formed by summing L, M, N, and O, P tc is 

formed by summing L, M, K ac/ and O^, P A is formed by 
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summing L 2RC , M, N, and O m , and P B is formed by summing L^, 
M, Ha,., and ; 

third means to select one of the group P,„, P tc , P A , and P B , 
to provide as P; and 

5 accumulation means, to accumulate P with Z; 

wherein all summations are done in an interconnected array 
of substantially identical 3-bit adders arranged in a 
Wallace tree. 



5. Apparatus to obtain a product P for use in a digital 
10 processing system, wherein P is one of the group P te =X t c Y te' 

V*tc*«' and *.=*«*tc wherein X is 
representable as vector (a a . 1f a B . zf . . . ,a„) and Y as vector 
( b n-i ' b n-2» • • • ' b o> ' comprising: 

L means, to form L = a m _., b n ., 2 ; 

15 M means, to form H ■ SS a 5 b } 2 Hj for i from 0 to m-2 and j 

from 0 to n-2; 

N means, to form H = S a, b nM 2"- 1+f for i from 0 to m-2; 



O means, to form O - S v, bj 2-" 1+J for j from 0 to n-2; and 
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P means, to form P of one of the group P^, P tc , P A r and P B , 
wherein P w is f rmed by summing L, M, N, and O, P tc is 
formed by summing L and M and subtracting N and O, P A is 
formed by summing M and N and subtracting L and O, and P B is 
5 formed by summing M and O and subtracting L and N. 



6* The apparatus of Claim 5, further comprising means 
within each of said L, N, and O means, to form 
compensations L ac , N^, and O^, respectively, so that any 
subtraction of L, N, or O within said P means is obtained 
10 by a summation of I*2r C , N ac , or 0 ac , respectively. 



7. Apparatus to obtain a product P for use in a digital 
processing system, wherein P is one of the group Ptc=X tc Y tc , 

p u» =x ™ Y imm' * A =x tc y imw and p b =x i» Y tc wherein X is 
representable as vector (a^ , a ffl . 2 , . . . , a^ and Y as vector 
15 (b n . 1f b n . 2 ,...,b 0 ), comprising: 

L means, to form L = a^., b n _., 2 ro * n " 2 ; 

M means, to form M = EE a f h } 2 Hj for i from 0 to m-2 and j 
from 0 to n-2 ; 

N means, to form N = E a f b^ 2."" 1 * 1 for i from 0 to m-2; 
2 0 O means, to form O = E a^., h } 2 n ~ U] for j from 0 to n-2; 
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compensation means, to form one or more compensations L^, 
N^, and O ai: , in the event that one or more of L, N, and O, 
respectively, are negative; and 

P means, to form P of one of the group P m , P tc , P A , and P B , 
5 wherein P m is formed by summing L, M, N, and O, P tc is 
formed by summing I*, M, N^, and O^, P A is formed by 
summing L^, M, N, and O^, and P B is formed by summing L^, 
M, N^, and O. 

8. Apparatus to form a product P of X=(a m . 1r a m . 2 , . . . ,a 0 ) and 
10 Y=(b n . 1f b n . 2 , • . . ,b 0 ) , wherein P is for use in a digital 
processing system, and wherein each of X and Y may be in 
two's complement mode or in unsigned mode, comprising: 

first means, to form terms a { bj for all i and j yielding 
positive values; 

15 second means, to form a combination of positive terms 
equivalent to any combination of terms a f bj which is 
negative ; and 

third means, to selectively combine said terms formed by 
said first means and said combination of positive terms 
20 formed by said second means, to form P. 
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9 Apparatus to form a product ^(P^-i-P^' • • * of 

xlta^ a 0 ) and Y-Cb^b^ b 0 > , wherein P is for 

use in a digital processing system, and wherein each of X 
and Y may be in two's complement mode or in unsigned mode, 
5 comprising: 

means to complement terms a, and bj to obtain terms A f and 
Bj , respectively, for OSiSm-l and 0<j<n-l; 

means to form terms V i b n-i' a ' b J' a » * ^ 

and a.., B jf for 0<i<m-2 and 0£j£n-2? 



10 means 

a «-1 b n-1' 



to form a term which is the complement of said term 



means to form at least one term having the constant value 
1? and 

means to selectively combine said terms so formed. 



15 



10. The apparatus of Claim 9, wharein. for X and Y both in 
unaignad moda, aaid -aana for salactivaly coahining 
includes: 

means to form p k for 0£kSm + n-2, by summing said terms a, bj 
where i+j=k, and carries from the sum for k-l. 
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11. The apparatus of Claim 9, wherein, for X and Y both in 
two's complement mode, said means for selectively combining 
includes : 

means to form p k for 0<k<m+n-l, 

5 by adding to the sum for each k for 0<i£m-2 and 0<j<n- 

2 said terms a f b j# a^, b nM , A, b n .,, a,,., B jf and a.., b n _ 
wherein the sum of the subscripts within each term 
is equal to k, 

by adding said terms a |n . 1 and b n . 1 to the sum for 
10 k=m+n-2 , 

by adding said terms A^, B^, and 1 to the sum for 
k=m+n-l, and 

by adding carries from the sum for each k-1 to the sum 
for each k. 

15 12. The apparatus of Claim 9, wherein, for X in two's 
complement mode and Y in unsigned mode, said means for 
selectively combining includes: 

means to form p k for 0<k<m+n-l, 
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by adding to the sum f r ach k for 0£i£m-2 and 0<j£n- 
2 said terms a f b j# a n . 1# a f b n _ 1# a^ B., and A m . 1 B n _,, 
wherein the sum of the subscripts within each term is 
equal to k, 

5 by adding said terms a^ and 1 to the sum for k=m+n-2, 

by adding said terms A^.-, and 1 to the sum for k=m+n- 
1; and 

by adding carries from the sum for each k-1 to the sum 
for each k. 



10 13. The apparatus of Claim 9, wherein, for X in unsigned 

mode and Y in two's complement mode, said means for 
selectively combining includes: 

means to form p k for 0<k<m+n-l, 

by adding to the sum for each k for 0<i<m-2 and O^j^n- 
15 2 said terms a, h } , b n . 1# A 1 b nM , a roM b jf and A m . 1 B n . 1f 

wherein the sum of the subscripts within each term is 
equal to k, 

by adding said terms b^,, and 1 to the sum for k=*m+n-2, 
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by adding said terms and 1 to the sum for k=m+n- 
1; and 



by adding carries from the sum for each k-1 to the sum 
for each k. 



5 14. The apparatus of Claim 9, wherein said means for 

selectively combining includes: 

means to form p k for 0£k<m+n-2, to form P for X and Y both 
in unsigned mode, by summing said terms a, bj where i+j=k, 
and carries from the sum for k-1; 

10 means to form p k for 0<k<m+n-l, to form P for X and Y both 
in two's complement mode, 

by adding to the sum for each k for 0£i£m-2 and 0<j<n- 
2 said terms a, bj, a^, b n . lf A f b n . 1f a^.-, Bj, and a m-1 b n . 
i, wherein the sum of the subscripts within each term 
15 is equal to k, 

by adding said terms a m . 1 and b n . 1 to the sum for 
k=m+n-2 , 

by adding said terms B n . 1f and 1 to the sum for 

k=m+n-l, and 
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by adding carries from the sum for each k-1 to the sum 
for each k; 

means to form p k for 0<k£m+n-l, to form P for X in two's 
complement mode and Y in unsigned mode, 

5 by adding to the sum for each k for 0£i£m-2 and 0<j<n- 

2 said terms a, bj , a..,, a f b n . v a.., Bj , and V , 
wherein the sum of the subscripts within each term is 
equal to k, 

by adding said terms a,,., and 1 to the sum for k=m+n-2, 

10 b Y adding said terms A.., and 1 to the sum for k=m+n- 

1, and 

by adding carries from the sum for each k-l to the sum 
for each k; 

means to form p k for OSkSm+n-l, to form P for X in unsigned 
15 mode and X in two's complement mode, 

by adding to the sum for each k for 0<i£m-2 and osjsn- 
2 said terms a, bj , b n .„ a, b n .„ a,,, bj , and Vl B n .„ 
wherein the sum of the subscripts within each term is 
equal to k, 



20 



by adding said terms b n . t and 1 to the sum for k=m+n-2, 
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by adding said terms and 1 to the sum for k=m+n- 
1; and 



by adding carries from the sum for each k-1 to the sum 
for each k. 



5 15. Apparatus to obtain a product P for use in a digital 

processing system, wherein P is one of the group P tc =X tc Y tc/ 

p unG =x t« Y Kwm* P A =x tc Y i«' and p b =x «« Y tc wherein X is 
representable as vector (a m . lf a m . 2 , . • . , a 0 ) and Y as vector 
(b nM ,b n _ 2 ,... r b 0 ), comprising: 



10 first means, to form 

M = EE a f bj 2 Hj for i from 0 to m-2 and j from 0 to 
n-2, 

N — S a f b n . n 2 n " 1+f for i from 0 to m-2, and 

15 O = £ a^ bj 2 nM+j for j from 0 to n-2; 



second means, to form P„, P tc , P A , and P B , wherein P^ is 
formed by summing L, M, N, and O, P tc is formed by summing 
L and M and subtracting N and O, P A is formed by summing M 
and N and subtracting L and O, and P B is formed by summing 
20 M and O and subtracting L and N; and 
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third means, to select one of the group V m t P tc , P A , and P B , 
to provide as P. 



16. The apparatus of Claim 15, which further comprises 
means within each of said L, N, and 0 means, to form 
5 compensations N^, and O^,., respectively, so that any 

subtraction of L, K, or O within said second means is 
obtained by a summation of L ac , N^, or O^, respectively. 



17. Apparatus to obtain a product P for use in a digital 
processing system, wherein P is one of the group P tc =*X tc Y tc , 

10 V^w Y un»' V*t* Y «r and *e =x w Y tc wherein X is 

representable as vector (a^^a^, . . . ,a 0 ) and Y as vector 
(frn-i' to n-2'-«*' b o>' comprising: 

first means, to form 

15 M = 2S a f bj 2 HJ for i from 0 to m-2 and j from 0 to 

n-2, 

N = S a, b n _, 2 n " ui for i from 0 to m-2, and 
o = 2 a^ bj 2 mM+i for j from 0 to n-2; 

compensation means, to form one or more compensations L^, 
20 N 2rc and °2rc * n the event that one or more of L, N, and O, 
respectively, are negative; and 



WO 91/19249 PCT/US91/00823 

30 

second means, to form P of one of the group P^, P tc , P A , and 
P B , wherein P^ is formed by summing L, M, N, and O, P tc is 
formed by summing L, M, N^, and O ac/ P A is formed by 
summing I^, M, N, and O^, and P B is formed by summing I^., 
5 M, N-^, and O; and 

third means to select one of the group P m , P te , p a , and P B , 
to provide as P. 

18. The apparatus of Claim 17, which further comprises 
accumulation means, to accumulate P with an accumulation 

10 term Z. 

19. The apparatus of Claims 1, 2, 3, 4, 5, 7, 8, 14, 15 or 
17, wherein at least one of said substantially identical 3- 
bit adders includes: 

a sum generator, comprising: 

15 first and second XOR means; 

first and second inputs of said first XOR means being 
differentially responsive to first and second bits, 
respectively, and first and second inputs of said 
second XOR means being differentially responsive to an 
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output of said first XOR m ans, and a third bit, 
respectively ; 

wherein an output of said second XOR means is able to 
provide a differential output for the sum generator, 
5 having a first state when an odd subset of said first, 

second, and third bits are true, and having a second 
state when an even subset of said first, second, and 
third bits are true; 

wherein at least one of said first and second XOR 
10 means comprises: 

first, second, third, and fourth tristate means; 

an input of each of said tristate means being 
inputs W, X, Y, and Z, respectively, of the XOR 
means ; 

15 an output of each of said first and second 

tristate means being connected together to form 
a first XOR means output; 



20 



an output of each of said third and fourth 
tristate means being connected together to form 
a second XOR means output; and 
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said first and third tristate means being 
tristated when an enable input is in a first 
state, and said second and fourth tristate means 
being tristated when said enable input is in a 
second state; 

wherein said inputs W and Z are responsive 
together and said inputs X and Y are responsive 
together, said inputs W and X are differential- 
ly responsive to a bit A, said enable input is 
10 differentially responsive to a bit B, and said 

first and second logic circuit outputs are able 
to differentially provide an XOR function of said 
bits A and B; 

a carry-out generator, comprising: 

15 a first input, responsive to said output of said first 

XOR means; 

a second input, differentially responsive to one of 
said first and second bits; 

a third input, differentially responsive to said third 
20 bit; and 
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logic means, able to provide a first differential 
state when at least two of said first, second, and 
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third bits are true, and .otherwise able to provide a 
second state; 



wherein said sum generator and said carry-out 
generator are able to provide a sum bit and a carry- 
5 out bit, respectively, on said first, second, and 

third bits; 

wherein said logic means comprises: 

first, second, third, and fourth tristate means; 

an input of each of said tristate means being 
10 inputs W, X, Y, and Z, respectively, of the logic 

means ; 

an output of each of said first and second 
tristate means being connected together to form 
a first logic means output; 

15 an output of each of said third and fourth 

tristate means being connected together to form 
a second logic means output; and 



20 



said first and third tristate means being 
tristated when an enable input is in a first 
state, and said second and fourth tristate means 
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being tristated when said enable input is in a 
second state; 

wherein said enable input is said first input, 
said inputs X and Z form said second input, and 
5 said inputs W and Y form said third input; 

wherein said sum generator and said carry-out generator are 
able to provide a sum bit and a carry-out bit, 
respectively, on said first, second, and third bits. 

20. The apparatus of Claims 5, 7, 8, 9, or 15, which 
10 further comprises accumulation means, to accumulate P with 

an accumulation term Z. 

21. The apparatus of Claims 5, 7, 8, 9, 15, or 17, wherein 
all summations are done in an interconnected array of 
substantially identical adders. 

15 22. The apparatus of Claims 5, 7, 8, 9, 15, or 17, wherein 

said substantially identical adders comprise 3-bit adders. 



23. The apparatus of Claims 5, 7, 8, 9, 15 or 17, wherein 
all summations are done using a Wallace tree technique. 
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24. A multiplier-accumulator array, 
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having adders 



comprising: 

a sum generator, comprising: 

first and second XOR means; 

5 first and second inputs of said first XOR means being 

differentially responsive to first and second bits, 
respectively, and first and second inputs of said 
second XOR means being differentially responsive to an 
output of said first XOR means, and a third bit, 
10 respectively; 

wherein an output of said second XOR means is able to 
provide a differential output for the sum generator, 
having a first state when an odd subset of said first, 
second, and third bits are true, and having a second 
15 state when an even subset of said first, second, and 

third bits are true; 

wherein at least one of said first and second XOR 
means comprises: 



first, second, third, and fourth tristate means; 
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an input of each of said tristate means being 
inputs W, X, Y, and Z, respectively, of the XOR 
means ; 



an output of each of said first and second 
5 tristate means being connected together to form 

a first XOR means output; 



an output of each of said third and fourth 
tristate means being connected together to form 
a second XOR means output; and 

10 said first and third tristate means being 

tristated when an enable input is in a first 
state f and said second and fourth tristate means 
being tristated when said enable input is in a 
second state; 



15 wherein said inputs W and Z are responsive 

together and said inputs X and Y are responsive 
together, said inputs W and X are differential- 
ly responsive to a bit A, said enable input is 
differentially responsive to a bit B, and said 

20 first and second logic circuit outputs are able 

to differentially provide an XOR function of said 
bits A and B; 



a carry-out generator, comprising: 
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a first input, resp nsive to said output of said first 
XOR means; 

a second input, differentially responsive to one of 
said first and second bits; 

5 a third input, differentially responsive to said third 

bit ; and 

logic means, able to provide a first differential 
state when at least two of said first, second, and 
third bits are true, and otherwise able to provide a 
10 second state; 

wherein said sum generator and said carry-out 
generator are able to provide a sum bit and a carry- 
out bit, respectively, on said first, second, and 
third bits; 

L5 wherein said logic means comprises: 

first, second, third, and fourth tristate means; 

an input of each of said tristate means being 
inputs W, X, Y, and 2, respectively, of the logic 
means ; 
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an output of each of said first and second 
tristate means being connected together to form 
a first logic means output; 

an output of each of said third and fourth 
5 tristate means being connected together to form 

a second logic means output; and 
said first and third tristate means being 
tristated when an enable input is in a first 
state , and said second and fourth tristate means 
10 being tristated when said enable input is in a 

second state; 

wherein said enable input is said first input, 
said inputs X and Z form said second input, and 
said inputs W and Y form said third input; 

15 wherein said sum generator and said carry-out generator are 

able to provide a sum bit and a carry-out bit, 
respectively, on said first, second, and third bits. 

25. The apparatus of Claims 5, 7, 8, 9, 15, 17, or 24, 
wherein said adders are arranged in a Wallace tree. 



20 



26. A machine method to obtain a product P from terms X 
and Y and accumulate it with a term Z, the result for use 
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in a digital processing system, wherein P is on of the 

group P tc =X tc Y tc , P^ihb P A =X tc Y ww' and P B =X ins Y tc 

wherein X is representable as vector (a m . 1 , a^. 2f • . . f a 0 ) and Y 
as vector , b n _ 2 , . . . ,b 0 ) , comprising the steps of: 

5 forming L - a^ b n _, 2"* n * 2 ; 

forming M = EE a { b. 2 Uj for i from 0 to m-2 and j from 0 to 
n-2 ; 

forming N = E a, b n . 1 2 n " u< for i from 0 to m-2 ; 

forming O » E a^ bj 2 m " uj for j from 0 to n-2; 

10 forming one or more compensations L 2RCf N^, and O ac , in the 

event that one or more of L, N, and O, respectively, are 
negative; 

forming P of one of the group P^, P te , P A , and P B , wherein 
P^ is formed by summing L, M, N, and O, P te is formed by 
15 summing L, M, N 2RC/ and O^, P A is formed by summing 1^., M, 

N, and O^, and P B is formed by summing L^, M, K ac , and O; 
and 



accumulating P with Z. 
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27. A machine method to form a product P of 
X^a^a^, . . . r a 0 ) and *=(*>„.,, b n . 2 , . . . ,b 0 ) , and accumulate P 
with a term Z, wherein the result is for use in a digital 
processing system, and wherein each of X and Y may be in 
5 two^ complement mode or in unsigned mode, comprising the 
steps of: 

forming terms a f bj for all i and j yielding positive 
values ; 

forming a combination of positive terms equivalent to any 
10 combination of terms a f bj which is negative; 

selectively combining said terms and said combination of 
positive terms to form P; and* 

accumulating P with Z. 



28. A machine method to form a product ^(P^n-i'IVn-a' • • • #Po) 
15 of X=(a m . 1 ,a m . 2f . . . f a 0 ) and Y=(b n _ lf b n _ 2 , - . . ,b 0 ) , and accumulate 

P with a term Z, wherein the result is for use in a digital 
processing system, and wherein each of X and Y may be in 
two's complement mode or in unsigned mode, comprising the 
steps of: 

20 providing complement terms A i and Bj, respectively, for 

0£i<m-l and 0<j<n-l; 
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providing terms 3^., b n . v a, h }i a f b n . v A f b n _ 1f a^., b } , and a„„ 
1 B jf for 0<i<m-2 and 0<j<n-2; 

providing a term which is the complement of said term 

a m-1 b n-1? 

5 providing at least one term having the constant value 
1; and 

selectively combining said terms so formed, including: 

forming p k for 0£k£m+n~2, to form P for X and Y both in 
unsigned mode, by summing said terms a- bj where i+j=k, 
10 and carries from the sum for k-1; 

forming p k for 0<k<m+n-l, to form P for x and Y both in 
two's complement mode, 

by adding to the sum for each k for 0Si^m-2 and 
0<j£n-2 said terms a f b } , a^, b n . f/ A f b n-1 , a^ B } , 
15 and a^! b n . 1 , wherein the sum of the subscripts 

within each term is equal to k, 

by adding said terms a^ and b^ to the sum for 
k=m+n-2 , 
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by adding said terms A ni . 1 , B n . 1# and 1 to the sum 
for k=m+n-l, and 
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by adding carries from the sum for each k-1 to 
the sum for each k; 



forming p k for 0£k<m+n-l, to form V for X in two's 
complement mode and Y in unsigned mode, 

5 by adding to the sum for each k for 0<i<m-2 and 

0£j£n-2 said terms a, b j# a,^, a f b n . 1# a m _., B jf and 
Vi B n-1' wherein the sum of the subscripts within 
each term is equal to k, 

by adding said terms a^ and 1 to the sum for 
10 k=m+n-2 , 

by adding said terms Vi and 1 to the sum for 
k=m+n-l, and 

by adding carries from the sum for each k-1 to 
the sum for each k; 

15 forming p k for 0<k<m+n-l, to form P for X in unsigned 

mode and Y in two's complement mode, 

by adding to the sum for each k for 0£i<m-2 and 
0<j£n-2 said terms a f bj, b n-1 , A t b n . v a^, b Jr and 
Vi B n-i' therein the sum of the subscripts within 
20 each term is equal to k f 
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by adding said -terms and 1 to the sum for 
k=m+n-2 , 



by adding said terms B n _-, and 1 to the sum for 
k=m+n-l , and 

by adding carries from the sum for each k-1 to 
the sum for each k; and 



accumulating P with 2. 



29. A machine method to obtain a product P and accumulate 
it with a term Z, the result for use in a digital 
10 processing system, wherein P is one of the group P tc =X tc Y tc/ 

Y «»' * A =x tc Y i*m> and Y tc wherein X is 

representable as vector (a B . 1f a Rl .2# • • • * a o) and Y as vector 
(b n . 1# b n . 2 , . . . ,b 0 ) , comprising the steps of: 

forming 

15 L = Vl b^ 2™-*, 

M = ZS a f bj 2 i+J for i from 0 to m-2 and j from 0 to 
n-2, 

N = S a f b n _, 2 n " VM for i from 0 to m-2, and 
O = S a^ bj 2 m * HJ for j from 0 to n-2; 
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forming one or more compensations X^c' K 2rc and °2rc *- n 
event that one or more of L, N, and O, respectively, are 
negative ; and 

forming P of one of the group P^, P tc , P A , and P B , wherein 
5 P OT is formed by summing L, M, N, and O, P^ is formed by 

summing L, M, N^, and o^, P A is formed by summing I*2r C , M, 
N, and 0^, and P B is formed by summing L^, M, K^, and O; 

selecting one of the group P^, P tc , P A , and P B , to provide 
as P; and 



10 accumulating P with 2, 
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FIG. 11C 19/36 A = a B = b C = c 
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